Testing for arbitrary interference on experimentation platforms∗
نویسندگان
چکیده
Experimentation platforms are essential components of modern large IT companies, as they are used to carry out a large number of randomized experiments daily. On such platforms, the classic assumption of no interference among users—that is, the fact that the outcome of a user does not depend on the treatment assigned to other users—is rarely tenable. Here, we introduce an experimental design strategy for testing whether this assumption holds. Our approach is in the spirit of the Durbin-Wu-Hausman test for endogeneity in econometrics, where multiple estimators return the same estimate if and only if the null hypothesis holds. The design that we introduce makes no assumptions on the interference model between units, nor on the network among the units, and comes with a sharp bound on the variance, and an implied analytical bound on the type I error rate. The idea is to obtain two different estimates of the total treatment effect leveraging two randomization strategies—namely, complete randomization and clusterbased randomization—and to analyze the distribution of their difference. We propose a multilevel assignment strategy for obtaining these two estimates simultaneously, and we develop theoretical guarantees for rejecting the null hypothesis that no interference holds, regardless of the type of interference. We discuss how to apply the proposed design strategy to large experimentation platforms, and we illustrate it in the context of a live experiment on the LinkedIn experimentation platform.
منابع مشابه
THE USAGE OF ARTIFICIAL NEURAL NETWORKS IN HYDRODYNAMIC ANALYSIS OF FLOATING OFFSHORE PLATFORMS
Floating offshore structures, particularly floating oil production, storage and offloading systems (FPSOs) are still in great demand, both in small and large reservoirs, for deployment in deep water. The prediction of such vessels’ responses to her environmental loading over her lifetime is now often undertaken using response-based design methodology, although the approach is still in its...
متن کاملModel-Based Testing as a Service for IoT Platforms
The Internet of Things (IoT) has increased its footprint becoming globally a ’must have’ for today’s most innovative companies. Applications extend to multitude of domains, such as smart cities, healthcare, logistics, manufacturing, etc. Gartner Group estimates an increase up to 21 billion connected things by 2020. To manage ’things’ heterogeneity and data streams over large scale and secured d...
متن کاملOptimal Testing for Crowd Workers
Requesters on crowdsourcing platforms, such as Amazon Mechanical Turk, routinely insert gold questions to verify that a worker is diligent and is providing high-quality answers. However, there is no clear understanding of when and how many gold questions to insert. Typically, requesters mix a flat 10–30% of gold questions into the task stream of every worker. This static policy is arbitrary and...
متن کاملPerformance Evaluation of Space-time Coding on an Airborne Test Platform
Typical airborne test platforms use multiple telemetry transmit antennas in a top and bottom configuration in order to mitigate signal shadowing during maneuvers on high dynamic platforms. While mitigating one problem, this also creates a co-channel interference problem as the same signal, time delayed with differing amplitude, is sent to both antennas. Space-Time Coding (STC) was developed wit...
متن کاملWeb Control and Monitoring System: Experimentation with Haematococcus Pluvialis (TECHNICAL NOTE)
This paper presents both, the design and the development of a monitoring and control system via web for a closed microalgae crop and the results that were gotten using the strain Haematococcus pluvialis. The research was done at Sabana University (Colombia) and it aims to quantify the kinetic growth associated to the increment of biomass and the development of red pigment inside the cells when ...
متن کامل